In the Title 

Please replace the title with the following: 
A PROCESS AND SYSTEM FOR ADAPTIVE AND PROGRESSIVE AUDIO STREAM 
DESCRAMBLDSfG OF AUDIO STREAMS 
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In the Specification 
Please replace paragraphs [0001] through [0013] with the following: 
Related Application 

This is a §371 of International Application No. PCT/FR2003/050098, with an international 
filing date of October 21, 2003 (WO 2004/039075, published May 6, 2004). which is based on 
French Patent Application No. 02/13090, filed October 21, 2002. 
Field of the Invention 

Th e pr e s e nt This invention relates to th e art of th e processing efdigital audio streamsT 

Th e present inv e ntion propos e s supplyin g , particularly to a system permitting the-auditory 
scrambling and recomposing of digital audio content. 

Th e pr e s e nt inv e ntion r e lat e s mor e particularly to a d e vice capabl e of transmitting in a s e cur e 
manner a s e t of audio str e ams with a high auditory quality to a musical or sp ee ch play e r in ord e r to 
b e r e corded in th e m e mory or on th e hard disk of a s e t top d e cod e r box connecting th e transmission 
n e twork to th e audio play e r whil e pr e s e rving th e auditory quality but avoiding any fraudul e nt us e 
such as th e possibility of making pirat e d copi e s of audio programs r e cord e d in the m e mory or on th e 
hard disk of th e set top d e cod e r box. 

Th e inv e ntion conc e rns a proc e ss for th e distribution of digital audio s e qu e nc e s according to 
a nominal str e am format constitut e d by a succession of fram e s, e ach comprising at l e ast on e digital 
audio block grouping a c e rtain numb e r of coeffici e nts corr e sponding to simple audio e l e m e nts cod e d 
digitally according to a manner sp e cifi e d in th e str e am conc e rn e d and us e d by all audio d e cod e rs 
capabl e of playing it in ord e r to b e abl e to corr e ctly d e cod e it. This process compris e s: 

A pr e paratory stage consisting in modifying at l e ast one of thes e co e ffici e nts, 

A transmission stag e 
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Of a main str e am in conformity with th e nominal format constitut e d by fram e s 

containing th e blocks modified in th e cours e of th e pr e paratory stag e and 

By a path, s e parat e from this main str e am, of compl e m e ntary digital information 

allowing th e r e constitution of the original str e am from th e computation on th e targ e t e quipment as a 
function of th e main str e am and of the complem e ntary information. — This compl e m e ntary 
information is d e fin e d as a s e t constitut e d by data ( e .g., co e ffici e nts d e scribing th e original data 
stream or e xtracts of th e original str e am) and by functions ( e .g., th e substitution or int e rchanging 
function). A function is d e fin e d as containing at l e ast on e instruction putting data and op e rators in a 
r e lationship. This compl e m e ntary digital information d e scribes th e op e rations to b e carri e d out for 
r e cov e ring th e digital str e am from th e modifi e d str e am. 

Th e r e constitution of th e original str e am is carri e d out on th e targ e t e quipment from th e 
modifi e d main str e am alr e ady pr e s e nt or s e nt in r e al tim e on th e targ e t e quipm e nt and from th e 
compl e mentary information s e nt in r e al tim e comprising data and functions e x e cut e d with th e aid of 
digital routin e s (s e t of instructions). 
Background 

Th e prior art alr e ady knows WO 00/58963 (Liquid Audio) discloses a security system for 
portable music players from int e rnational pat e nt application WO 0058963 (Liquid Audio) . Data 
such as a musical track is saved as a secure portable track (SPT) that can be linked to one or several 
players and can be linked to a particular saving means, thus restricting the reading of the SPT to 
specific players and ensuring that the reading is carried out only from the original saving means. The 
SPT is linked to a player by the encryption of data of the SPT using a save key that is unique to the 
player, difficult to change and is guarded by the player under strict security conditions. The SPT is 
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linked to a particular means of saving including data uniquely identifying the save means in a form 
resistant to falsification, that is, signed in an encrypted manner. 

A system for scrambling audio signals is also known from US pat e nt 4,600,941 (Sony) 
discloses a system for scrambling audio signals in which an audio signal is divided into blocks, each 
of which is formed by a plurality of frames, which plurality of frames is rearranged on a time base in 
an order predetermined for each block in such a manner as to be encoded, and the encoded signal is 
rearranged on a time base in an original order in such a manner as to be decoded. This system 
comprises a first circuit for processing the signal in ord e r to insert a redundant portion into a portion 
between contiguous frames and to compress the frames in base time in response to the redundant 
portions during the encoding, comprises a circuit generating a signal for inserting a control signal 
other than audio information in the redundant portions, a circuit for detecting the control signal for 
detecting the control signal during the decoding and a second circuit for processing the signal for 
removing the redundant portions in synchronism with the detected control signal and decompressing 
the frames in base time in response to the redundant portions. 

A method and a syst e m for scrambling and d e scrambling audio information signals is also 
known from U S pateft^5,058,159 (Macrovision eCorporation) discloses a method and a system for 
scrambling and descrambling audio information signals . The audio signals are scrambled by 
inverting the original frequency spectrum in such a manner that the frequency portions that are 
originally at the bottom in the audio frequency band are shifted to the top whereas the portions 
originally at the top of the band are shifted to the bottom. A pilot sound of a known frequency is 
recorded with the audio signals of the shifted frequencies. During the reproduction each variation in 
phase and in frequency is searched by its pilot that is used to generate the modulation signal for 
reconstituting the original content in audio signal frequencies. 
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International patent application WO 99/55089 "Multimedia Adaptive Scrambling System" 
also t e aches discloses a system for scrambling digital samples representing multimedia data (audio 
and video) in such a manner that the content of the samples is degraded but recognizable or 
otherwise supplied with the required quality. The level of quality is linked to an associated 
signal/noise ratio and is determined with the aid of objective and subjective tests. A given number of 
LSB's (least significant bits) is scrambled frame by frame in an adaptive manner as a function of the 
dynamics of the possible values. All the encryption keys are included in the audio/video stream and 
used by the decoder for descrambling and restoring the stream. After the descrambling the 
encryption key cannot be recovered because it is scrambled itself by the decoder. 

The stat e of the art giv e s evid e nc e o f discloses many systems for the protection of audio 
streams based substantially on the encryption of data adding encryption keys independent of the 
content of the audio stream and which therefore modify the format of the structured stream. One 
particular and different realization is that of the Coding Technologies company, that consists inof 
protecting by scrambling a selected part of the bitstream ("bitstream" refers to the binary stream at 
the output of the audio encoder) and not the entire bitstream. The protected parts represent the 
spectral values of the audio signal with the result that during the decoding without decryption the 
audio stream is distorted and disagreeable to the ear. 

Th e pr e s e nt inv e ntion has th e problem of e liminating the disadvantages of the prior art by 
proposin g lt would therefore be advantageous to provide an adaptive and progressive system for 
descrambling the content played as a function of the profile and of the rights of the client. 
Summary of the Invention 

This invention relates to a process for distributing digital audio sequences in streams 
including data sequences containing digital audio blocks including modifying at least a part of the 
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data sequences in an original stream to produce a modified stream in a same nominal format as the 
original stream, transmitting the modified stream, and reconstructing the original stream with the aid 
of a decoder, wherein reconstruction of the original stream is adaptive and progressive as a function 
of information coming from a digital profile of a target. 

This invention also relates to the process wherein the modification produces a modified main 
stream and complementary information that permits reconstruction of the original stream by a 
decoder by transmitting the modified stream and transmitting to target equipment a subpart of the 
complementary modification information, which subpart is determined as a function of information 
coming from a data profile of the target. 
Brief Description of the Drawing 

The invention will be better understood with the aid of the following description made purely 
by way of explanation of an embodiment of the invention with reference made to the Drawing, which 
shows a particular embodiment of the client-server system in accordance with aspects of the 
invention. 

Detailed Description 

The invention relates to a device capable of transmitting in a secure manner a set of audio 
streams with a high auditory quality to a musical or speech player to be recorded in the memory or on 
the hard disk of a set-top decoder box connecting the ransmission network to the audio player while 
preserving the auditory quality, but avoiding any fraudulent use such as the possibility of making 
pirated copies of audio programs recorded in the memory or on the hard disk of the set-top decoder 
box. 

The invention concerns a process for the distribution of digital audio sequences according to 
a nominal stream format constituted of a succession of frames, each comprising at least one digital 
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audio block grouping a certain number of coefficients corresponding to simple audio elements coded 
digitally according to a manner specified in the stream concerned and used by all audio decoders 
capable of playing it in order to be able to correctly decode it. This process comprises: 

a preparatory stage consisting of modifying at least one of these coefficients, 
a transmission stage of a main stream in conformity with the nominal format 
constituted of frames containing the blocks modified in the course of the preparatory stage 
and 

by a path, separate from this main stream, of complementary digital information 
allowing reconstitution of the original stream from the computation on the target equipment 
as a function of the mains stream and of the complementary information. This 
complementary information is defined as a set constituted of data (e.g., coefficients 
describing the original data stream or extracts of the original stream) and functions (e.g., the 
substitution or interchanging function). A "function" is defined as containing at least one 
instruction putting data and operators in a relationship. This complementary digital 
information describes the operations to be carried out for recovering the digital stream from 
the modified stream. 

Reconstruction of the original stream is carried out on the target equipment from the 
modified main stream already present or sent in real time on the target equipment and from the 
complementary information sent in real time comprising data and functions executed with the aid of 
digital routines (set of instructions). 

In th e pr e s e nt inv e ntion th e The term "scrambling" hereinafter denotes the-modification of a 
digital audio stream by appropriate methods in such a manner that that this stream remains in 
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conformity with the norm or standard with which it was digitally encoded while rendering it audible 
by an audio reader (or player) but altered as concerns human auditory perception. 

In th e pr e s e nt inv e ntion the The term "descrambling" hereinafter denotes the process of 
restoration by appropriate methods of the initial stream and the restored audio stream is identical 
after the descrambling to the original initial audio stream. Th e r e constitution R econstitution of the 
original stream is carried out on the target equipment from the modified main stream already present 
or sent in real time on the target equipment and from the complementary information sent in real 
time comprising data and functions executed with the aid of digital routines (set of instructions). 
The entirety or a subpart of the complementary information is sent as a function of the profile and of 
the rights of the client. 

Please replace paragraphs [0015] through [0016] with the following: 

The type of information contained in this subpart corresponds to a level of scalability 
determined as a function of the profile of the target. The nature of the data and/or functions 
belonging to the complementary information sent to the target during the connection is defined as the 
type. For example, the type of data is relative to the habits of the target (connection time, duration of 
the connection, regularity of the connection and of payments), te-histhe environment (lives in a big 
city, the time at the present moment) and to his characteristics (age, sex, religion, community). 

This complementary information is composed at least byof functions that are personalized for 
each target relative to the connection session. A session is defined starting from the connection time, 
the duration, the type of saidthe modified stream listened to and the connected elements (targets, 
servers). 
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Please replace paragraph [0018] with the following: 

The term "profile" of the user denotes a data file comprising descriptors and information 
specific to the user, e.g. his-cultural preferences and his-social and cultural characteristics, his-habits 
of use such as the frequency of using audio means, the average listening time of a scrambled audio 
sequence, the frequency of listening to a scrambled sequence, the price the user is ready to pay or any 
other behavioral characteristic regarding the use of audio sequences. This profile is formalized by a 
data file or a data table that can be used by computer means. 

Please replace paragraphs [0020] through [0021] with the following: 

Th e pres e nt inv e ntion has th e probl e m of e liminating th e disadvantag e s of th e prior art by 
proposing an adaptiv e and progr e ssiv e syst e m for d e scrambling th e cont e nt play e d as a function of 
th e profil e and of th e rights of th e client. 

In th e pr e s e nt inv e ntion an An adaptive and progressive descrambling of the content listened 
to is applied as a function of the profile and of th e rights of each user. The server sends only the 
subparts of said-complementary information, that has a structure characterized by a "granular 
scalability" for supplying the target with a more or less scrambled content as a function of certain 
criteria, profiles and rights. Th e notion of "scalabilit e [Fr e nch]" is d e fin e d from th e English word 
"scalability", which "Scalabilitv" characterizes an encoder capable of encoding or a decoder capable 
of decoding an ordered set of binary streams in such a manner as to produce or reconstitute a 
multilayer sequence. Granularity is defined as the quantity of information that can be transmitted per 
layer of a system characterized by any scalability, which system is then also granular. The granularity 
is relative to the degree of scrambling. The audio stream is completely scrambled once for all 
targets. Then, the server sends all or part of this complementary information in such a manner that 
the stream is played more or less scrambled by each of the targets. The sent content of this 
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complementary information and the content played on the client player are a function of each client 
and the server manages and carries out the sending in real time at the moment of listening for each 
listener. 

Please replace paragraphs [0024] through [0029] with the following: 
According to a variant one aspect, the modified main stream is recorded on the target 
equipment prior to the transmission of the complementary information on the target equipment. 
According to a varian t another aspect, the modified main stream is recorded on a physical support m 
ord e r to be transmitted to the target equipment prior to the transmission of the complementary 
information on the target equipment. According to yet another variaftt aspect, the modified main 
stream and the complementary information are transmitted together in real time at the moment of 
listening. 

The determination of this subpart is advantageously realized by a method of granular 
scalability and the quantity of information contained in this subpart corresponds to a level of 
scalability determined as a function of the target profile. According to a variant one aspect, the type 
of information contained in this subpart corresponds to a level of scalability determined as a function 
of the target profile. According to a particular r e alizatio n aspect, this complementary modification 
information comprises at least one digital routine suitable for executing a function. 
Please replace paragraphs [0031] through [0044] with the following: 
This complementary information is advantageously subdivided into at least two subparts. 
According to a variant one aspect, these subparts of the complementary information are distributed by 
different media. According to another varian taspect, these subparts of the complementary 
information are distributed by the same media. According to a particular r e alization aspect the 
complementary information is transmitted on a physical vector. 
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According to a variant one aspect, the complementary information is transmitted online. 

TheseThe digital sequences are advantageously in conformity with a given norm or standard. 

At least a part of saidthe client profile is preferably stored on equipment of the target. 

The type of information contained in saidthe subpart is advantageously updated as a function 
of the behavior of saidthe target during the connection to the server or as a function of his-habits or 
as a function of data communicated by a third party. 

According to a variant one aspect, the process comprises a prior analog/digital conversion 
stage with a structured format, which process is applied to an analog audio signal. 

The pr e s e nt invention also relates to a system for the distribution of digital audio sequences 
comprising an audio server comprising means for broadcasting a stream modified in conformity with 
the previously described process and a plurality of pieces of equipment provided with a descrambling 
circuit, characterized in that the server also comprises means for recording the digital profile of each 
target and means for analyzing the profile of each of the targets of a modified stream, which means 
orders the nature of the complementary information transmitted to each of these analyzed targets. 

According to a variant one aspect, the level (quality, quantity, type) of complementary 
information is determined for each target as a function of the state of its profile at the moment the 
main stream is listened to. 

Th e inv e ntion will b e b e tt e r und e rstood with th e aid of th e following d e scription made pur e ly 
by way of e xplanation of an e mbodim e nt of the inv e ntion with ref e r e nce made to th e attach e d figur e : 
Figur e 1 shows a particular e mbodim e nt of th e cli e nt s e rv e r syst e m in accordanc e with th e 
inv e ntion. 

A digital audio stream is generally constituted byof sequences constituted byof blocks or 
frames organized according to a specific digital format for each audio coder. The AC - 12 - 
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(advanced coding) Dolby coder performs the transformation of the time - frequency audio signal and 
the spectral envelope is represented in the form of exponents. A special procedure determines how 
many bits are to be allocated for the representation of the mantissas, that are quantified as a 
consequence, knowing the arrangement of these elements in the bitstream constituted byof several 
audio blocks containing information about the dithering (digital treatment whose goal is to obtain a 
better approximation of a digital audio signal by adding a low-amplitude random signal), the 
coupling, exponents, allocation of the bits, the mantissas. The values of the exponents are coded in 
differential and by modifying these values very little the entire block can be corrupted and 
consequently the following blocks. 

Our invention can eensistcomprise, e.g., in a non-limiting manner^ inof modifying the value 
of certain fields for an AC-3 stream, especially, e.g., the values of exponents and of mantissas 
whether for one or several blocks or any other elements of the stream structured in such a manner as 
to obtain an AC-3 stream that is perfectly in conformity^ but whose auditory quality is degraded and 
to store in complementary information organized in different layers of scalability the information 
necessary for a decoder for reconstituting the parts of the original stream or the integrality of the 
stream. When the server decides not to totally descramble the stream to be heard for a given target 
or when the rights of a user are insufficient for the server to send him the entire complementary 
information, the server can, e.g., restore only the true values of certain exponents and mantissas in 
such a manner that the audio stream is more or less descrambled but not the rest of the modified 
information. 

Please replace paragraphs [0047] through [0055] with the following: 
Our invention can eensis tcomprise , e.g., in a non-limiting manner^ inof modifying the value 
of certain fields for an MPEG-AAC stream, in particular, e.g., the values of x[g] [win] [sfb] [bin], 
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global_gain, scale_factor_data, the subscripts of the LSP index_lsp [], or interchange the spectral 
values [w] [x] [y] [z] whether for one or for several blocks or any other elements of the stream 
structured in such a manner as to obtain an MPEG-AAC stream that is perfectly in conformity but 
whose auditory equality is degraded and to store in complementary information organized in 
different layers of scalability the information necessary for a decoder to reconstitute the parts of the 
original stream or the integrality of the stream. When the server decides to not totally descramble the 
stream to be listened to for a given target or when the rights of the user are insufficient for the server 
to send him the totality of the complementary information, the server can e.g., restore only the true 
values of certain values of global_gain and of the subscripts LSP index_lsp [] in such a manner that 
the audio stream is more or less scrambled but not the rest of the modified information. 

In th e attach e d drawing figur e 1 shows a pr e f e rr e d particular e mbodiment of th e cli e nt s e rv e r 
syst e m in conformity with the invention. 

Th eTurning now to the Drawing, the original stream 101 can be directly in digital form 1 1 1 
or in analog form 11. In the latter instance analog stream 1 1 is converted by a coder (not shown) into 
digital format 111. In the remainder of the text we wiH-take note 1 of the input digital audio stream. 
The MPEG-AAC stream that is to be secured 1 is passed to an analysis and descrambling system 121 
that will-generates modified main stream 122 in the MPEG-AAC format identical to input stream 1 
except that certain coefficients have been replaced by values different from the original ones and is 
placed in output buffer memory 122. Complementary information 123 of any format contains 
information relative to the elements of the audio blocks that were modified, replaced, substituted or 
moved, and their value or emplacement in the original stream. 

The stream in MPEG-AAC format 122 is then transmitted either in physical form on a CD- 
ROM, a non- volatile memory, DVD, etc. or via a transmission network 4 of the following types: 
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Telephone network, DSL (digital subscriber line), BLR (local radio loop), DAB (digital audio 
broadcasting), RTC (commutated telephone network), digital mobiles (GSM, GPRS, UMTS), 
microwave, cable, satellite, e.g., to the terminal of the spectator 8 and more precisely into his 
memory or onto his-hard disk 85. When target 8 requests to hear the audio sequence present in his 
memory or on his-hard disk 85, two possibilities are possible: either the spectator 8 does not have 
the rights necessary to listen to the sequence. In this case MPEG-AAC stream 122 generated by 
scrambling system 121 present in memory 85 is passed to synthesis system 85 via a reading buffer 
memory 83 that does not modify it and transmits it identically to a classic audio MPEG-AAC player 
81 and its content, heavily degraded auditorily by scrambling system 121, is played on listening 
device 6. 

Or the server decides that user 8 has the rights to hear the audio sequence, which can be 
tested, e.g., with the aid of a system based on a smart card 82 connected to synthesis system 87. In 
this case the synthesis system makes a listening request to server 12 containing the information 
necessary 123 for recovering the original audio stream 101 . Server 12 then sends the complementary 
information via telecommunication networks of the following types: Analog or digital telephone 
line, DSL (digital subscriber line) or BLR (local radio loop), via the networks DAB (digital audio 
broadcasting), RTC, (commutated telephone network), or via digital telecommunication networks 
(GSM, GPRS, UMTS) 5, which complementary information allows the reconstitution of the audio 
stream 123 in such a manner that the target 8 can store it in buffer memory 86. Synthesis system 87 
then proceeds to the restoration, in the scrambled MPEG-AAC stream which it reads in its reading 
buffer memory 83, of the modified fields whose positions it knows as well as the original values by 
virtue of the content of the complementary information read in buffer memory 86 for descrambling 
the audio. The amount of information contained in complementary information 123 that is sent to 
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the descrambling system is specific, adaptive and progressive for each target and depends on his 
rights, e.g., single or multiple use, right to make one or several private copies, delayed or advance 
payment. 

The level (quality, quantity, type) of complementary information is determined as a function 
of each target, as a function of the state of his-profile at the moment of the transmission of the 
complementary stream and at least a part of this profile is stored on target equipment. For example, 
in figur e l the Drawing, part of the user profile is recorded on smart card 82 connected to synthesis 
system 87 as, e.g., the frequency of the connections or the regularity of payments. The remainder of 
the profile can be on the server as, e.g., the type of music or audio clips that the spectator prefers. 

Another e mbodim e nt aspect is the updating of the target profile, which also depends on the 
connection time to the server (referring to the behavior) in order to know if the client connects 
regularly (reference to his habits) or updating as a function of recovered data close to a consumer 
database already existing on a server and relative to this client. 

Another e mbodim e nt consists in that aspect is where the server transmits all the 
complementary information to the target during the first minutes of listening to the audio sequence 
then, in the course of time, transmits less and less complimentary information to the target in such a 
manner as to descramble the main stream less and less, thus producing the effect for the target that 
the sound coming from the headset or the loudspeakers becomes more and more scrambled. This 
functionality can encourage the target to purchase the rights for the sequence played. 

Another e mbodiment consists in that aspect is where all or part of complementary information 
123 is transmitted to the target on a physical vector such as a memory card or a smart card 82. 
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